Talend Data Quality Essentials

SubscriptionThis content is available for Talend Academy subscription users.Instructor-ledThis content is available as instructor-led training. Open learning plan - EN Open learning plan - FR

 

Talend Studio for Data Quality enables data governance teams to assess the quality of data in any data source. Talend Data Quality also lets you verify data completeness, accuracy, and integrity in preparation for data migration, instance consolidation, and data integration.

 

This learning plan is designed to help you immediately utilize Talend Studio for Data Quality. You learn how to evaluate data quality according to a set of metrics and thresholds based on indicators, models, and rules for each data item to be analyzed or monitored. You also use data integration Jobs for simple data cleansing tasks.

 

Duration: 2 days (14 hours)

 

Target audience: Anyone who wants to use Talend Studio for Data Quality to assess data quality

 

Prerequisites: Completion of Introduction to Talend Studio or Talend Data Integration Basics, as well as familiarity with SQL

 

Learning objectives: After completing this learning plan, you will be able to:

  • Connect to a database and run an analysis on it

  • Examine the contents of a connection to a data source

  • Create, configure, and run a column analysis

  • Generate regular expressions for pattern matching in an analysis to test data quality

  • Define indicator thresholds that are flagged in analysis results when violated

  • Create, configure, and run different types of table analysis

  • Define a SQL business rule and set up an analysis to identify rows that conflict with your rule

  • Create, configure, and run a table match analysis to search for duplicates

  • Use advanced matching to enhance identification of duplicates

  • Ensure data privacy by shuffling and masking customer data

 

Training modules: To complete the learning plan, take the following training modules: